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Abstract 

O , Let G = {V, E) be an undirected loopless graph with possible parallel edges and s, t e 

w ■ V. Assume that s is labelled at the initial time step and that every labelled vertex copies 

(~| , its labelling to neighbouring vertices along edges with one labelled endpoint independently 

with probability p in one time step. In this paper, we establish the equivalence between the 
expected s-t first arrival time of the above spread process and the notion of the stochastic 
shortest s-t path. Moreover, we give a short discussion of analytical results on special graphs 
including the complete graph and s-t series-parallel graphs. Finally we propose some lower 
CN ■ bounds for the expected s-t first arrival time. 

Keywords, random processes on graphs, stochastic shortest s-t path, s-t reliability polyno- 
^ ' mial, virus propagation in networks 

(N 

^ \ 1 Introduction 

O 

The spreading of information in networks is a random process. Consider an undirected loopless 
graph that possibly has parallel edges G = {V, E). Let s g F be a chosen vertex that is labelled 
! at time step 0. In addition we assume that that every edge of G that has a labelled endpoint in- 
■ dependently copies the label to the possibly unlabelled endpoint with the given infection prob- 
. 5^ . ability p in one time step. 

Some applications are network based models for virus spread in epidemiology or the analy- 
sis of gossipping across social networks. The analysis of malware propagation in computer net- 
works, such as spread of computer viruses or worms, represents another important application 
in technical networks. 

The epidemic threshold is an interesting measure that characterises the transition from local 
to global spread. Random graphs and small-world networks gained most attention for investi- 
gations in this field (see for instance |T| or fS"!). Certain regularity conditions are necessary in 
order to make the analysis of large or infinite graphs possible. An application for the analysis of 
the outbreak of Severe Acute Respiratory Syndrome (SARS) is presented in L6J . 
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We deal with the following questions in this paper. First we present the problem of finding 
the expected s-t first arrival time using the generating function approach (see Section O. Sec- 
ond we establish in Section |3] the relationship between the spread process and the stochastic 
shortest s-t path problem stated in |9, Chapter 9]). In Section[4]we introduce the s-t spreading 
resistance of a graph. We show that this notion is related to Kulkarni's concept 1 4 1 of the expected 
length of a shortest s-t path, whenever the edge lengths are exponentially distributed random 
variables with intensity p. Subsequently, we establish two reduction techniques for calculating 
the expected first arrival time in s-t series-parallel graphs (see Section^. Due to the computa- 
tional complexity of the spread process, we propose in Section[7]methods for yielding lower and 
upper bounds for the expected s-t first arrival time in terms of the s-t reliability polynomial and 
the distance between s and t. 

2 The spreading process 

All graphs considered in this paper are finite and loopless, but possibly with parallel edges. In 
this section, we present how the spread process propagates in arbitrary graphs. 

2.1 Assumptions 

Let G = {V,E)he a graph and V he the set of labelled vertices in G at the discrete non- 
negative integral time step k. Assume that u € Xk copies the message to an adjacent unlabelled 
vertex v with probability puv = ^ - quv along the edges {u, v} in G. We refer to the numbers puv 
and quv as the infection and noninfection probability from utov. 

The spread is performed simultaneously along all edges that link labelled vertices to unla- 
belled vertices within one unit of time. The copying of labels along the edges is assumed to be 
stochastically independent. In addition the message spread is symmetric, that is for all u,v eV, 
Puv = Pvu- We presuppose that puv > for any {u, v} G E, whereas {u, v} ^ E imphes puv = 0. 

Therefore we can describe the spread process by a homogeneous Markov chain {Xkjken, 
where the state space is a subset of 2^. If G is a connected graph then the labelled vertices 
spread until all vertices of G are labelled; that is, limfc_j.oo Xk = V. 

Let G = {V, E) be a graph and v & V. The open neighbourhood ofv, denoted by N{v), is 
the set of vertices that are adjacent to v in G. For each A c V, the open neighbourhood of A is 
defined by 

N{A) := J iV(^;)\^. 

Furthermore for A,BCV, the A-B edge set in G, denoted by {A, B), is the set of edges of G with 
one endpoint in A and the other endpoint in B. That is, 

{A,B) := {{u,v} eE:u£A,ve B}. 

Definition 1. Let G = {V, E) be a graph and A,BCV with A c B. The transition probability 
Pg{A,B) is the probability that the vertex subset B becomes infected in one time step if the 
vertex subset A is infected beforehand. 
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Figure 1: Transition probability of the spread process from A to -B in G 



Theorem 2. Let G = iV, E) be a graph and A,B <^ V with A c B. The transition probability 
Pg{A, B) is then given by 



pg{a,b)=\{ n n 

v&Ae£{{v},N(A)\B) x£B\A 



n 



Moreover ifG is a simple graph, then 



PGiAB) = ll Yl q,^ Yl 

veAw£N{A)\B xeB\A 



/e{W,A) 



n 'i^y 

y&N{x)r\A 



Proof. Figured] explains the derivation of Pg{A, B) in a simple graph G. There must be no label 
propagation along edges from A to N{A) \ B that are represented as broken lines. This gives the 
first double product. The remaining products correspond to transmissions along solid drawn 
edges. For each vertex x G B \ A, there must be at least one edges that transports the label 
which gives the term in brackets. ■ 

At this point, we denote the transition probability from ^4 to in the graph G as P{A,B), when- 
ever G is known from the context. 

Corollary 3. Let G = {V, E) be a graph and A,BCV with A c B. Furthermore, assume that 
all infection probabilities are identical; that is, puv = p = 1 - g for all {u, v} e E. The transition 
probability from AtoB in G is then given by 

P{A,B) = q\(^'''(^^\^^\ Yl [l-gl(W'^)r 

xeB\A 

Note that P{A, B) can be only nonzero if 5 c N{A) U A holds. 



2.2 First arrival times 

We first describe some general properties of the spread process. In order to simplify the presen- 
tation, we now assume that all infection probabilities are identical. 
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Definition 4. Let G = {V, E) be a graph and X c 1/ be a nonempty vertex subset. The graph 
Gx is obtained by merging the vertex subset X in G. The merged vertex in Gx is denoted by X' , 
where Gx contains an edge {X' , y} for each edge {x, y} ^ EinG with x ^ X and y G y \ X. 

Lemma 5. Let G = {V,E) be a graph and A,X c V. If A' = {A\ X) U {X'} is set, then 
Pg{X,A) = Pg^{{X'},A') holds. 

Proof. Let Ng{X) and Ngx{{X'}) be the corresponding open neighbourhood of X in G and 
{X'} in Gx, then \{X, N{X) \A)\ = \ {{X'}, N{{X'}) \ A)\ holds. The merging operation provides 
\{X, {a})\G = \{{X'}, {a})\Gx for alia e A\X. Consequently, the exponents of q from Corollary 
|3] remain unchanged while transforming G into Gx- ■ 

Thus it is advantageous to work with Gx instead of G since the number of vertices is reduced, 
but the graph Gx rnay not be simple even in the case that G is a simple graph. However, Gx 
can be transformed into a simple graph by replacing all parallel edges joining a pair of vertices 
u,v e V by just one edge {u,v}. That is, two parallel edges namely, e = {u,v} and / = {u,v} 
with infection probabilities pe and pj are replaced by a new edge g with infection probability 
Pg = Pe+Pf — PeP f ■ In this casc we obtain the more general model including edges with different 
probabilities. 

Definition 6. Let G = {V, E) be a graph and A,B c V . Denote by Zab the random variable of the 
first time that all vertices in the set B are infected if the vertex set A is infected at the beginning 
of the spread process. 

The following lemma describes the transition from one state to another in terms of a Markov 
chain. This result will be utilised to establish Theorem [9i In the following c denotes the proper 
subset inclusion. 

Lemma 7. Let G = {V, E) he a graph and A,BCV such that An B c B, then 

Fi{{Zab = n})=Y^ ^(^. C) Pv{{ZcB = n - 1}) (1) 

CDA 

holds for all n > 1. 

Proof. The condition AnB c B implies that there is at least one vertex t £ B that is not yet in- 
fected, when the spread process reaches the vertex subset A. The probability that all vertices in 
B are infected for the first time after n time steps can be divided into disjoint events by consider- 
ing one further time step. In one time step a superset C of A can be infected, where the transition 
probabilities are given by the P{A, C). In order to infect B for the first time after n time steps, 
it is necessary to reach B from C in exactly n — 1 time steps for the first time to account for the 
already elapsed extra time step. ■ 

Definition 8. Let G = {V, E) be a graph and A,BQV. The ordinary generating function of the 
A-B first arrival times is defined as 

^ab{G,z) = Y,^i{{Zab = n})z^. 

n>0 

It clearly follows from the definition of Zab that ^ab{G, z) = 1 holds for the case ofA^B. 
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Theorem 9. The ordinary generating function of the A -B first arrival times satisfies the recurrence 
relation 

^ab{G, z)=Y,^ P{^^ C) <^cb{G, z) 

for all A, B c V with An B c B, where the initial conditions ^cb{G, z) = 1 hold for all C 5 B. 

Proof. The statement follows directly from LemmaH Note that the assumption of An B o B 
implies Fi{{Zab = 0}) = 0, as S cannot be reached from A in zero time steps. ■ 

Observe that the nice triangular structure of the system of linear equations in Theorem HI 
caused by P{A, B) = for all pairs {A, B), whenever A is not a subset of B, permits the practical 
solution of systems with more than 10^ equations. It is now possible to deduce a recurrence 
relation for the expectation of the random variable Zab by using the generating function for 
the A-B first arrival time probabilities. A similiar approach for the later discussed exponential 
model is presented in ID . 

Corollary 10. Let G = (V, E) he a graph and A,BCV with A n B B. The expectation of the 
random variable Zab, denoted by Tab (G), obeys the recurrence equation 

Tab{G) = nZAB] = ^) \Tcb{G) + 1] (2) 

with the initial conditions Tab {G) = I, for any A ^ B. ■ 

In the case that the sets A and B are singletons say A = {s}, B = {t}, we will write Tst{G) 
instead of Ts^g^^^{G). This notation is also used when only one of the two sets is a singleton, e.g. 
TAt{G) means TA{t}{G). The same notation also applies to the random variable Zab{G) and the 
later defined spreading resistance pAt(G). 

3 Stochastic shortest paths 

In this section, we recast the problem of the spread process in a stochastic shortest path prob- 
lem. A nice introduction to solve stochastic shortest path problems in an algebraic way in the 
case of discrete arc lengths assuming finitely many values is found in |9 , Chapter 9] . 

Definition 11. Let G = {V,E) be a graph and s,t G V. Furthermore let Dgt be the random 
variable of the length of a shortest s-t path in G, if the lengths L(e) of the edges e G in G are in- 
dependent geometric random variables with parameter p, where p equals the success (infection) 
probability of the spread process. 

Theorem 12. LetG = {V, E) be a graph ands, t eV, then Dgt = Zgt- 

Proof. For all e g we let L(e) be independent random variables representing the edge 
lengths, that are drawn from a geometric distribution with success probability p = I - q. That is 
for all n > 1, 

VT{{L{e)=n}) = {l-q)q^-\ 

Now for all ^4 c y the random variable Dai denotes the shortest length among all possible 
lengths ofshortestu-t paths in G with ti G A, that is, = m.m.{Dut'- u G A}. Note that whenever 
the target vertex t satisfies t € A,we have Pr{{DAt = 0}) = 1. 



5 



Now for every A V and v £ N{A), let the random variable L{A, v) be the smallest edge 
length among all lengths of edges contained in {A, {v}). That is, 

L{A, v) = min{L(e) : e £ {A, {v})}. 

We observe that 

Ft{{L{A,v) = 1}) = 1 

With the notions of the length L(e) of an edge e and L{A, v), we can restate the random variable 

DAt as 

DAt = min{D,t + Lilu, v}) : {u, v} € {A, N{A))} 
= mm{D^t + L{A, v):ve N{A)}, 

whenever A cv\{t} holds. By defining the event 

A(A, B) := {L{A, v) = 1: V £ B}r\ {L{A, v) > I: v e N{A) \ B} 
for each ACV and B Q N{A),we obtain 

Pr(A(^, B)) = Yl Pr({L(A, v) = 1}) J] Pr({L(A, v) > 1}) = P{A, AuB), 

v€B v£N{A)\B 

which coincides to the transition probability of Section [2j The application of the law of total 
probability on the event {Dai = n} yields 

Pv{{DAt = n})= P{A,AuB)FT{{DAt = n}\A{A,B)). 

BCN{A) 

For each AcV\{t} and B c N{A), we have 

^\{A,B) 



Vv{{DAt = n} I A(^, B)) = Vr[\ min D^t + L{A, v) = n 

Now the conditional probability on the right hand side can be restated by reducing for all v £ 
N{A) the realisation of the random variables L{A, v) by one and asking for the probability of a 
shortest A-t path of length n - 1. That is, 

'\'{A,B) 



Vr{{DAt = n}\A{A,B)) = Vv[{ min D^t + L{A,v) = n - 1 

\ lveN{A) 

with 

A' {A, B) = {L(A, v) =():v £ B]r\ {L{A, v)>l:v£ N{A) \ B}. 

Now all f G S with L{A, v) = can be included in the set A, as edge lengths of size zero do not 
contribute to the length of a shortest A-t path, whereas for each v g N{A) \ B the condition 
L{A, v) > lis always satisfied. Thus the condition A' {A, B) can be omitted and 

Vr{{DAt = n} I A(^, B)) = VT{{DAyjB,t = n-l}) 
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follows. Hence, the recurrence relation 

Pr{{DAt = n})= P{A,AuB)Fi{{DAuB,t = n-l}) 

BCN{A) 

= Y,nAC)Fri{Dct = n-l}), 

CDA 

is valid for allAQV \ {t} with initial conditions Pr{Dct = 0) = 1 for all C c y with t e C. This 
recurrence relation coincides with O, whenever Zai is replaced by Dai- Hence, Dai = Zai is 
valid for all yl c y and so Dst = Zst follows. ■ 

An important consequence of the above correspondence is the symmetry of the s-t first arrival 
time probabilities, which is only obvious in the view of the stochastic shortest path formulation 
of the spread process. 

Corollary 13. LetG = {V, E) be a graph and s,t then Zgt = Zts- ■ 



4 Spreading resistance 

The spread process is connected to a continuous time Markov chain, which was already exam- 
ined by Kulkarni and Corea in IHEl . 

Definition 14. Let G = (F, E) be a connected graph with A c y and t G V. Let TAt{G) be the 
expected A-t first arrival time in G. The number pAt{G) defined by 

PAt{G) = \im{'^-q)TAt{G), 
whenever this limit exists, is called the A-t spreading resistance ofG. 

Theorem 15. Let G = {V, E) be a graph then the A-t spreading resistance obeys the recurrence 
relation 



PAt{G) 



1 



l(AiV(^))l 



1+ \{A,{c})\PA^{c},t{G) 

c£N{A) 



for all A c V andt e V with {A,N{A)) / and initial conditions pctiG) = for all C c V 
satisfying t e G. 

Proof. Observe that the system of linear equations for the expected A-t first arrival times that 
is obtained from with the initial condition Tct{G) = 0, for all C c y and t e C can be restated 
as 



TAt{G) 



1 



l-P{A,A) 



l+Y,P{A,G)Tct{G) 



COA 



As a consequence, the limit value of (1 - TAt{G) yields to 



lim(l - q) TAt{G) = lim ^ ^ 

q^l q^l\ — F[A,A) 



l+Y,nAG)Tct{G) 



C^A 



(3) 
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as q tends to 1. Now consider the two limits from the right-hand side of ^ 



Li 



lim 



l-q 



and L2 = lim 



P{A,C) 



q^l 1 - P{A, A) ^ q^ll- P{A, A) ' 

By applying Corollary |3] and L'Hopital's rule to the right-hand side of Li, one finds 

l-Q 1 



Li = lim — 

q^l 1 



\(A,N{A))\ \{A,N{A))\ 



as \{A,N{A))\ > 1. Similarly, 



lim 

<?->! 1 



MA,NiA)\C)\ 



q\{A,N{A))\ 
l(A{c})| 



n 



J{A,{c})| 



lim 



n [> 



l(AM)l 



(4) 



Now suppose |C \ ^1 > 2. Then for every c e C\A there is a ^; 7^ c that contributes the factor 
1 — to the product in Hence the limit vanishes for every c G C \ yl. In the case of 

C = Au{c} with c G A^(A) one readily finds 

\{A,{c})\ 



^ \{A,N{A))\ 

Now assume that PAu{c},ti^) exists for each c G A^(^) then © becomes 



PAtiG) 



1 



1+ 5] \{A{c})\PAu{c}AG) 

ceN{A) 



\iA,N{A))\ 

Corollary 16. Let G = {V, E) be a connected graph with A cv and t eV, then the limit 



PAtiG) = lim(l - q)TAtiG) 

always exists. ■ 

Definition 17. Let G = {V, E) be a graph. The exponential spreading model in G is a spread 
process such that every edge e G is weighted with uj{e) and {uj{e) : e G -E} is a collection of 
independent exponential random variables with intensity p. 

The following result, due to Kulkarni, shows the connection between pAt{G) and the expo- 
nential spreading model in G. 

Theorem 18 (Kulkarni |4|). LetG = {V,E) beagraph with A cv,teV, and{A,N{A)) / 0. // 
TAt{G) is the expected length of a shortest A-t path of the exponential spreading model in G, then 
TAt{G) satisfies the recurrence relation 



TAt{G) 



1 



U,N{A))\ 



l(A{c})|r^u{c},(G) 



ceN{A) 

with the initial condition Tct{G) = Q for all C c V witht g C. ■ 

The substitution of (1 — q)TAt{G) = PAt{G) transforms Kulkarni's result into the recursive defini- 
tion of pAt{G). This yields an interpretation of the A-t spreading resistance pAt{G). 
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5 Special graphs 



This section illustrates the problem of the spread process discussed in Sections[2]and|4]on graphs 
with a special structure. The simplest is a path P„ of length n. If s and t are the end vertices 
of Pn and p is the infection probability assigned to every edges in P„, then Tst{Pn) = n/p and 
Pst{Pn) = n. This result is useful when deriving some simple upper bounds as seen in Chapter[71 
Another trivial example is a tree, which follows from the fact that every two (distinct) vertices in 
a tree are joined by a unique path of nonzero length. 

Theoreml9. LetH{mi,m2, . . . ,mn) = {V,E) be the graph consisting of n parallel paths of lengths 
nii with 1 <i <n between the vertices s andt. Then itss-t spreading resistance is given by 

Pst{H{mi, 1712,..., rUn)) = y2y2^Ti\ ■ ■ ■ )' 

^^nJ + l \li,l2,...,lnj 

where the inner sum runs for all nonnegative integers ii, 12, ...,in such thatj = ii + 12 + ■ ■ ■ + in 
andik < iTik holds for all k with k = 1,2, . . . ,n. 

Proof. Suppose that for all j = 1, 2, . . . , mi, the weights uj{eij) of the edges in the i-th path of 
G = H{mi,m2, . . . , mn) are independent exponential random variables with intensity p. Hence 
the realisation {uj{eii),uj{ei2), . . . ,uj{eimj) of waiting times for the edges of the i-th path in G 
corresponds to the sum of the random variables uj{eij), j = 1, 2, . . . , mj. This implies that Yi = 
uj{eii) + uj{ei2) + . . . + uj{eimi) follows an Erlang-mj distribution with intensity p; that is, 

Pr({y.<n) = l-e-*^^, 

for each i = 1,2, . . . ,n. Now define the random variable Y = m.m.{Yi : i = 1, 2, . . . , n}, then 

n 

Pr({y<t}) = l-J]Pr({y, >t}). 

i=l 

Because y is a continuous random variable that takes nonnegative real values, it follows that its 
expectation obeys 

/■oo 

E[y] = / Pr({y > t}) dt. 

Jo 

With m' = Yli^iimi - 1), we obtain 

Jo f^Q^ j! n!»2!---«n! pf^^^nJ+^\ii,t2,...,iJ 

The inner sum in the above equation runs over all nonnegative integers ik with k = 1,2, ... ,n 
satisfying ii + 12 + . . . + in = j and 4 < m^. for all k with 1 < k <n. The desired claim follows by 
multiplying E[Y]hy p = 1 — q. ■ 

Davis and Prieditis |3| considered the expected .s-t first arrival time for the complete graph 
with respect to the exponential model. The same analysis can be also accomplished in the case 
of the spreading process, which is the assertion given by the following result. 
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Theorem 20. LetKn = (V, E) be the complete graph with n>2 vertices, ACV \ {t}, \A\ = i and 
Ti := TAt{Kn). ThenTi satisfies the recurrence relation 

j=i+i \ J ^ J 
with the initial condition Tn-i = 1/(1 — q^^^)- 

Proof. Note that in the case of the complete graph K„ the transition probabilities P{A, B) do 
not depend on the sets A and B themselves, but on their cardinality. Assume therefore that A is 
an i element set and that B is an j element superset of A, then we find 

P{A,B)=q'^''~^^[l-(iY'" . (5) 

Hence, the A-t first arrival time of Kn is obtained by plugging ||5ll in the recurrence formula of 
the A-t first arrival times stated in I©. That is with A c y \ {t} and \A\ = j, we have 

ACCCV'\{t} 

=i+x;(^T^-\^(-)[i-,r-T,. (6) 

The result then follows by factoring out Tj in fH) . ■ 

Consequently, the solution of the recurrence relation in Theorem [20] gives us the expected 
s-t first arrival time of the Kn by setting i = 1. In effect, the s-t spreading resistance of Kn with 
n>2 yields to the ratio of the (n — l)-st harmonic number and n — 1. This result was deduced in 
a different fashion in the work of David and Prieditis 1 3 1 . 
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6 Reduction techniques on series -parallel graphs 

In the case of s-t serial parallel graphs, we present an algebraic approach that utihses the Hadamard 
product of formal power series. The approach presented here is viewed as a special case of 
Shier's |9| general algebraic approach for the stochastic shortest path problem under the as- 
sumption that all s-t paths in G are edge disjoint. 

Theorem 21 (Series reduction technique). Let G = {V,E) be a connected graph with an artic- 
ulation a £ V such that there are two subgraphs Gi = {Vi,Ei) and G2 = (V2, E2) ofG satisfying 
G1UG2 = G andGinG2 = ({a},0). Lets G Vi\{a} andt G V2\{a}, then the following equations 
are true: 

(a) ^st{G,z) = ^sa{Guz)^at{G2,z) 
ib) Tst{G)=Tsa{Gi)+Tat{G2). 

Proof. Let Pr{{ZstiG) = n}) be the probability of the s-t first arrival time with respect to a 
spread process in G at n > Otime steps. Moreover, denote by Pr({Zsa(Gi) = n}) andPr({Zat(G2) = 



10 



n)}) the s-a and a-t first arrival time probabilities in Gi and G2 in n > time steps, respectively. 
Then we obtain 

71 

Pr{{Z,t{G) = n}) = j;Pr({Z,,(Gi) = k}) Pr({Z,t(G2) = n - k}) 

k=0 

Correspondingly, let ^st{G, z) be the ordinary generating function for the s-t first arrival time 
probability in G. Similarly, the functions <l>sa(Gi, z) and ^at{G2-, z) denote the ordinary gener- 
ating functions for the first arrival time that the contact process reaches a in Gi and t in G2, 
respectively. Hence 

(G, z)=Y, Pr({^st(G) = = ^^a (Gi, z) ^at {G2, z) . 

n>0 

follows. By applying the formal differentiation with respect to the indeterminate z and then 
setting z = I gives the desired expression Tst{G) = Tsa{Gi) + Tat{G2). ■ 

In order to establish the parallel reduction technique (Theorem[24]l, we need to introduce the 
notion of Hadamard multiplication in the ring C[[Z]] of all formal power series with complex- 
valued coefficients over the indeterminate z. 

Definition 22. Let A{z) = X]i>o"*-^* ^^"^ -^(^) = I^i>o t)e arbitrary formal power series in 
C[[Z]]. The Hadamard product of A{z) and B{z) is defined as 

{A B){z) := A{z) B{z) = ^ aihzK 

i>0 

In other words, the resulting formal power series A{z) B{z) is obtained by termwise multipli- 
cation of A{z) and B{z). It is clear that the Hadamard product is well-defined and is closed in 
C[[Z]] as presented in 1 10 1 . Together with the usual polynomial addition, polynomial multiplica- 
tion, scalar multiplication and the Hadamard product, C[[Z]] forms a commutative algebra over 
the field C. In addition, the geometric series 



J{z) = Y.z'^ = - 



k>0 



acts as the identity element of C[[Z]] under Hadamard multiplication. 

Let r be a positive integer. We call the formal power series J^iz) to be the r -geometric series, 
which is defined as 



1 Y ^ fr + k-l 



A;>0 



k 



k 



The following lemma presents a closed rational expression after pointwise multiplying m- and 
n-geometric series. 

Lemma 23. Letm and n be positive integers with m> n, then for any complex numbers a and b, 

J'^iaz) r{bz) = J'"+''-\abz) ^ ^ ^ ^ ^ 
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Proof. Suppose a and b are nonzero complex numbers. Assume further that m and n are posi- 
tive integers with m > n. Let F{z) = J2k>o fi^)^^ be the resulting formal power series obtained 
by taking the Hadamard product of J^{az) and J^'{bz). Then for each e N, the coefficient of z'' 
in F{z) yields 



We see that 



fik) 



n + k — 1 
k 



m + k — l\ /n + k — 1 



k 



{k + 1 



in-l 



(n-1)! 



k 



n-1 

E 

i=0 



{abf 



n — 1\ k- 



(7) 



where the rightmost part of is first obtained by treating {k + l)""^/(n - 1)! as a polynomial in 
k and then applying the Newton's forward difference equation to the said polynomial. Using 
and setting w = abz in F{z), we now obtain 



n-1 



Oberve that we can factor J"+™~^(i(;) from the sum in ID. This gives us 

Fiw) = r+^-\w) h-^y^^ + '-^V^-'- ^\ (_i)i^^+i 

j=o j=o V*/V ^ /V ^ / 



(8) 



n-1 j 



The identity 



j=0 4 = 



1 



n — 1\ fm + j 



1\ fn — j + i — 1 



n — J + I 
i 



1 



n — \ \ fm + j — i 
J-iJ\ j-i 
holds for each i with < i < j. In effect, F{w) in Q becomes 



]\fm + j-i-l 
i I \ m — 1 



F(w) = J 



n-1 

"ME 

j=0 



n — 1 
3 



E 

.j=0 



3\(m + j- i 
i I \ m — 1 



(-ir 



n — 1 
3 



(9) 



(10) 



We interpret the inner sum of lITOl l in a combinatorial way. Let Y be an {m + j - 1) -element 
set. Let / c y be a j -element subset of marked elements in Y. For each c /, we define 5a to 
be the collection of {m — 1) -element subsets of y such that all the elements of A are not included 
in the said subsets. In view of the principle of inclusion-exclusion, we have 



ACI 



E(-i) 



^1 



ACI 



aeA 



i=0 



j\ fm + j - i 
i / \ m — 1 



This gives us the number of ways we can draw an (m — 1) -element set from Y such that all the 
marked elements of / are included. In turn, this corresponds to the number of ways of forming 
an j- element subset of an arbitrary (m - 1) -element set. ■ 
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Theorem 24 (Parallel reduction technique). LetG = {V,E) be a graph and {s,t} be a separating 
vertex pair ofG such that H and K are two connected nontrivial graphs with H u K = G and 

HnK = ({s,t},0), then 



^stiG, z) = ^st{H, z) + ^st{K, z)-{l- z) 



^st{H,z)_^^stiK,z) 



1 



1 



Proof. Let Zst{G) be the random variable ofthes-t first arrival time in G, and ZstiH) and Zst{K) 
be the random variables of the s-t first arrival times in H and K, respectively. As the events 
{Zst{H) > n} and {Zst{K) > n} are independent, we have 

1 - Pr {{Zst{G) < n}) = (1 - Pr {{Zst{H) < n})) (1 - Pr {{Zst{K) < n})) . 

Multiplying both sides of the above equation with and summation over all n > yields 

l-^stiG,z) _ 1 <^stiH,z) '^st{K,z) ^ ^stiH,z) ^ ^st{K,z) 
I — z 1 — z 1 — z 1 — z 1 — z 1 — z' 

which leads after rearranging terms to the desired result. ■ 

We apply the parallel reduction for the series-parallel graph G with s and t as its start and 
terminal vertices such that {s,t} forms a vertex separating pair. The subsequent result deals 
with the case that one of the two subgraphs of G is a path of length r > 1. 

Corollary 25. Let G = {V, E) be a graph and {s, t} a separating vertex pair ofG, furthermore 
assume thatH andK are subgraphs ofG with G = Hu K andH n K = {{s,t}, 0) and thatK is a 
path of length r connecting s andt. Then the following equations hold: 



(a) ^st{G,z) 



[b] Tst{G) 



p z 



(1 - qzy 



rq 



r-l 



l-z d 
(r - 1)! 



T— 1 



r-l 



r-l 



X ''—I jr— 1 

p\ d 



<l>st{H, qz) 
1 — qz 

'-^^st{H,qz) 
1 — qz 



{l-qY+^ (r-l)! \q) dz""-^ 

Proof. For the first arrival times of the path K of length r in G, one finds 

1 



*..('^-.-)=e(::;) 

n>r ^ ^ 



v r 

p z 



(1 - qzy 



(11) 



By TheoremEU the ordinary generating function for the first arrival time of G yields 



^st {G,z) = ^st {H,z) + 



<^st {H,z) + 



V V 

p z 



p z 



(1 - qzy 

r T 

p z 

(1 - qzy 

r-l 



+(^-1) 



+(^-1) 



^st {H,z) 
l-z 

^st {H,z) 







p z 



(1 - z) (1 - qzy 

r-l 



(1 - qzy 

r r 

p z 

(1 - qzy 

T r 

p z 

(1 - qzy 



{l-z) 



1 - z 
^st {H,z 







1 



1 



z 



(1 - qzY 



+ 1^ 



r-l 



(l-z) 



1 - z 

1 

(1^ 







1 







(1 - qzY 
<^st {H, qz) 



r-l 



l-z d 
(r - ly.Jz 



r-l 



r-l 



,r-l 



1 — qz 

'^st {H, qz) 
1 — qz 
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After evaluating the formal derivative at z = 1 we obtain 



Tst{G) 



rq 



1 



P 



r-1 



z-- ^'^st{H,qz) 
1 — qz 



z=l 



Another instance of the parallel reduction technique is given by two parallel paths of lengths 
n and m between s and t. 

Corollary 26. LetG = {V,E) be the graph that consists of two parallel paths of lengths n > 1 and 
m>l between the vertices s,t where m > n. Then 



^st{G,z) = l 



P 



n+m—2 



1 - Z 



n-1 



(1 - gS^^m+n- 



Proof. Let ^st{Pi, z) be the ordinary generating function for the s-t first arrival time in a path 
of length^. Then the ratio (^st{Pi-, z)/{l - z) becomes 



^st{Pl,z) 1 



l-z 



1 - z 



p 



[I - qzY 



by Equation lUT} and the application of partial fraction decomposition. With £ = n and £ = m,it 
follows from the parallel reduction technique that 



<^st{G,z) = l-{l-z){^ 



n+m—2 



1 







1 



(1 - z)" (1 - g^z)™ 
The desired result now follows by applying Lemma|23]to QD. 



(12) 



7 Bounds for the expected first arrival time 

As a result of the exponentially increasing number of equations in Corollary [TOl the exact com- 
putation of the expected s-t first arrival times can be performed in reasonable time only in small 
networks. Hence, computational feasible bounds for Tst{G) are of great interest. 

Theorem 27. Let G = {V, E) be a graph with s,t ^ V and H = {V, F) with F c E being a 
subgraph of G, thenTst{H) > Tst{G). 

Proof. Every removal of an edge eeEofG causes an increase in the expected s-t first arrival 
time. ■ 

By considering a shortest s-t path as a subgraph of G one gets the following corollary giving 
an upper bound, that is not very tight in general. 

Corollary 28. LetG = {V, E) be a graph and s,t ^V. Ifd{s, t) is the distance ofs andt in G, then 
the inequality 

mt{G)] < ^ 

always holds. ■ 
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A known lower bound for Tst{G) was presented by Lyons (5] and is connected to the ideas pre- 
sented in |7|. That is, if Resst(G) is the electrical resistance between the vertices s and t in a given 
connected graph G, where the edges are assumed to have unit resistance, then 



ResstjG) 
1-q 



< Tst{G). 



A lower bound for Tst{G) can be found by considering the s-t reliability polynomial of G. 

Definition 29. LetG= {V,E) be a connected graph with m edges and i2sf(G,g) = CQ+ciq+C2q^ + 
... + Cmq"^ the s-t rehability polynomial of G, i.e. the probability that there is at least one intact 
s-t path in G, if the edges of G are failing independently with probability q. The s-t insertion 
probability, denoted by Rst{G, q), is then defined as 



TiieoremSO. LetG = {V,E) be a graph and s,t e V. Ifd{s,t) is the distance between s andt inG, 
then 



Proof. For each edge e e E, let L(e) be independent geometric random variables, with non 
infection probabilities q, representing the edge lengths. Denote now by Gt = {V, Ek) with k = 
0, 1, 2, ... a discrete random process that gives an infinite sequence of subgraphs G^ of G, where 
e € E'fc if and only if the event {L(e) < k} occurs. Moreover, denote by 1st the random variable of 
the smallest k, so that Gk contains an s-t path. 
Now the following inequality holds 



for all A; > 0. In order to prove this estimate, we assume an elementary event E in 
{Zst < d{s,t) + A; - 1}, e.g. a realisation of edge lengths, so that the length of a shortest s-t 
path is smaller or equal than d{s, t) + k - l.ln order that E occurs there must be at least one s-t 
path in G, so that the sum of the edge lengths of the path is smaller or equal than d{s, t) + k - I. 
Therefore all edge lengths of this s-t path must be smaller than k, as if there would be one edge 
length in this path that is greater than k + I, then the sum of the path lengths would exceed 
(/c + 1) + {d{s, t) - 1), which contradicts the assumption Zgt < d{s, t) -\- k - 1. As all edge lengths 
are smaller than k, the event E is by definition also in {1st < k}, i.e. Gk must also contains this 
s-t path. Furthermore the probabilities for the event E coincide in both probability spaces, so 
that ini follows. 

Due to JTSl l. we find that the expectation of the random variable 1st is smaller than the expec- 
tation of the random variable Zst - d{s, t) + 1. Hence, 



and this gives d{s, t) - 1 + E [1st] as a lower bound for Tst{G). 

In order to show the relationship between E[Ist] and the s-t reliability polynomial, consider 
the edge e = {u,v} and the event {L{e) < k}. We easily find that Pr({L(e) < k}) = I — q^, 
whenever the non insertion probability is q. Assume now that there are k parallel edges between 



m 




d{s,t)-l + Rst{G,q) <Tst{G). 



Pr {{1st < k}) > Pr {{Zst < d{s, t) + k - 1}) 



(13) 



E [1st] < E [Zst - d{s, t) + l]=E [Zst] - d{s, t) + 1 
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u and V, that are failing independently with probability q, then the probability that there is at 
least one non failing edge is also given by 1 - g*^. Therefore, the probability for the event {1st > k} 
coincides with the probability of the event that there is no s-t path in the multigraph G{k), where 
G{k) emerges from G by replacing each edge in G with k parallel edges. Equivalently, 

Pr({/,t > A:}) = 1 - Rst{G{k),q) = 1 - R^tiG, q'') 

and we can write 

E [Ist] = Y.k [RstiG, g^ . . . , g'^) - Rst{G, q^-\ q^-^) . 

k>l 

Now suppose that the s-t reliability polynomial is expressed as Rst{G, q) = X^I^Lo ^^^^ 

m m 

E [1st] = E E ^ - «'^'"'^) = E ^ = 9) 

i=0 k>l i=l ^ 

and hence, d{s, t) - 1 + Rst{G, q) < Tst{G). ■ 

8 Conclusion 

We have shown that there is a one-to-one correspondence of the spread process and the stochas- 
tic shortest path problem. Furthermore, it was shown that this process can be well-described by 
the exponential model, when the infection probability p tends to zero. Several closed formu- 
lae or at least efficient calculation schemes for special graphs, such as complete graphs, parallel 
paths and s-t series -parallel graphs are presented. Finally some ideas for bounds are proposed 
that utilise the well-known concept of the s-t reliability polynomial, which can be efficiently 
computed for graphs with bounded tree-width. 

The authors conjecture the following bound for the s-t first arrival time, that behaves good 
for values of q, that are close to 1, i.e. linig^i Tst{G) / Tst{G) = 1. 

Conjecture 31. Let G = {V, E) be a graph, s,t ^ V and Tst{G), Tst{G) be the expected s-t first 
arrival times of the spread process and the exponential model, respectively. In this case Tst{G) < 
Tst{G) holds for all < g < 1. 
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